(12) INTERNATIONAL APPLICATION PUBLISHED UNDER 



THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
9 August 2001 (09.08*2001) 





HUH 






PCT 



(10) International Publication Number 

WO 01/57249 Al 



(51) International Patent Classification 7 : 
C07H 21/00, B01J 19/00 



C12Q 1/68, 



(21) International Application Number: PCT/GB0 1/00421 

(22) International Filing Date: 1 Febniary 2001 (01.02.2001) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

0002389.5 



2 February 2000 (02.02.2000) GB 



(71) Applicant (for all designated States except US): SOLEXA 
LTD. [GB/GB]; Chesterton! Research Park, Little Chester- 
ford, Nr. Saffron Walden, Essex CB10 1XL (GB). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): BARNES, Colin 
[GB/GBj; Solexa Ltd., Chesterford Research Park, Little 
Chesterford, Nr. Saffron Walden, Essex CB 10 1XL (GB). 

(74) Agent: GILL JENNINGS & EVERY; Broadgate House, 
7 Eldon Street, London EC2M 7LH (GB). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, 
HU, ID, IL, IN, IS, JP. KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN. MW, MX, MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

— before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

OS 



(54) Title: SYNTHESIS OF SPATIALLY ADDRESSED MOLECULAR ARRAYS 



Q (57) Abstract: A method for synthesizing a spatially addressed array of polymers immobilised on a solid surface is disclosed, 
wherein the array has a surface density which allows each polymer to be individually resolved, e.g. by optical microscopy. There- 
^ fore, the arrays of the present invention consist of single polymers that are more spatially distinct than the array of the prior an. 



WO 01/57249 



PCT/GB01/00421 



SYNTHESIS OF SPATIALLY ADDRESSED MOLECULAR ARRAYS 
Field of the Invention 

This invention relates to fabricated arrays of polymers. In particular, this 
invention relates to the production of spatially addressed polymer arrays. 
5 Background of the Invention 

Advances in the study of molecules have been led, in part, by 
improvement in technologies used to characterise the molecules or their 
biological reactions. In particular, the study of nucleic acid, DNA and RNA, has 
benefitted from developing technologies used for sequence analysis and the 
10 study of hybridisation events. 

An example of the technologies that have improved the study of nucleic 
acids, is the development of fabricated arrays of immobilised nucleic acids. 
These arrays typically consist of a high-density matrix of polynucleotides 
immobilised onto a solid support material. Fodor et a/., Trends in Biotechnology 
15 (1994) 12:19-26, describes ways of assembling the nucleic acid arrays using 
a chemically sensitised glass surface protected by a mask, but exposed at 
defined areas to allow attachment of suitably modified nucleotides. 

An alternative approach is described by Schena et a/., Science (1995) 
270:467-470, where samples of DNA are positioned at predetermined sites on 
20 a glass microscope slide by robotic micropipetting techniques. The DNA is 
attached to the glass surface through its entire length by non-covalent 
electrostatic interactions. 

The arrays are usually provided to study hybridisation events, determine 
the sequence of DNA (Mirzabekov, Trends in Biotechnology (1994) 12:27-32) 
25 or to detect mutations in a particular DNA sample. Many of these hybridisation 
events are detected using fluorescent labels attached to nucleotides with 
fluorescence detected using sensitive fluorescent detector, e.g. charge 
coupled detector (CCD). However, the major disadvantages of these methods 
are that it is not possible to sequence long stretches of DNA and repeat 
30 sequences can lead to ambiguity in the results. These problems are 
recognised in Automation Technologies for Genome Characterisation, Wiley- 
Interscience, 1997, Ed. T. J. Beugelsdijk, Chapter 10: 205-225. 
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In addition, the use of multi-molecule high-density arrays in a multi-step 
analysis procedure can lead to problems with phasing. Phasing problems 
result from a loss in the synchronisation of a reaction step occurring on different 
molecules of the array. If a proportion of the arrayed molecules fails to undergo 
5 a step in the procedure, subsequent results obtained for these molecules will 
no longer be in-step with results obtained for the other arrayed molecules. The 
proportion of molecules out of phase will increase through successive steps 
and consequently the results detected will become ambiguous. This problem 
is recognised in the sequencing procedure described in US-A-5302509. 
10 Summary of the Invention 

According to the present invention, a method for forming a spatially 
addressable array of polymers immobilised on a solid support comprises the 
steps of: 

(i) contacting an array of single molecules with one or more 
15 detectably labelled monomers, under conditions that permit 

incorporation of a monomer onto a molecule of the array, wherein 
the labelled monomer comprises a removable blocking group that 
prevents further monomer incorporation occurring; 

(ii) removing non-incorporated monomers and detecting the label on 
20 the incorporated monomer; 

(iii) removing the blocking group and any separate label; and 

(iv) optionally repeating steps (i) - (iii) to form a single polymer of 
defined sequence; 

wherein the array has a surface density which allows each polymer to be 

2 5 individually resolved by optical microscopy. 

According to the present invention, high-density single polymer arrays 
are synthesised in a manner that permits the sequence of each polymer to be 
determined. As the sequence for each polymer is known, the result of the 
synthesis is a spatially addressed array. Further, the random addition of 

3 o monomers to the growing polymer strands in the synthesis procedure allows a 

vast diversity of different polymers to be formed. 
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The formation of spatially addressed high-density arrays has many 
important benefits for the study of the single polymer molecules and their 
interactions with other biological molecules. The arrays are particularly suitable 
for DNA analysis procedures using hybridisation-based approaches. Knowing 
5 the sequence of polynucleotides (polymers) on the array enables the user to 
quickly determine the sequence of a complementary polynucleotide hybridised 
thereto. 

Description of the Invention 

The present invention relates to the formation of single molecule polymer 

10 arrays using a step-wise synthesis procedure, whereby the identity of each 
monomer is determined at each incorporation step. 

The term "single molecule" and "single polymer" is used herein to 
distinguish from high-density, multi-molecule arrays in the prior art, which may 
comprise distinct clusters of many molecules of the same type. 

15 The term "individually resolved" is used herein to indicate that, when 

visualised, it is possible to distinguish one polymer on the array from its 
neighbouring polymers. Visualisation may be effected by the use of reporter 
labels, e.g. fluorophores, the signal of which is individually resolved. The 
requirement for individual resolution ensures that individual monomer 

2 o incorporation can be detected at each synthesis step. 

In general, the method may be carried out using conventional synthesis 
techniques which utilise the step-wise incorporation of monomers onto a 
growing polymer strand. 

The synthesised polymers may be of any biomolecule or organic 
25 molecule, including peptides and polypeptides. The polymers are preferably 
polynucleotides, e.g. DNA or RNA, and the monomers for incorporation may be 
the bases adenine (A), thymine (T), guanine (G) and cytidine (C). Uracil (U) 
may also be used. 

The monomers should be detectably-labeled and include a blocking 

3 o group to prevent incorporation of further monomers until after the detection step 

has been carried out. In one preferred embodiment, the label is, or is part of, 
the blocking group, and can be removed under defined conditions. Different 
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monomer types will usually be labeled with a distinct label. For example, in the 
context of DNA synthesis, each monomer base will have a specific label which 
characterises the base. This enables the stepwise incorporation of monomers 
to be monitored during the synthesis procedure. 
5 Preparation of monomers with suitable labels and blocking groups will 

be apparent to the skilled person. For DNA, conventional phosphoramidite 
chemistries may be used. The label (fluorophore) may be located on a 
protecting group or may be located at a separate position. A skilled person will 
appreciate that cleavable linker groups can be readily prepared, as in 

10 US-A-5302509. 

Suitable labels will also be apparent to the skilled person. In a preferred 
embodiment, the label is a fluorophore. Alternative labels may be used. A 
number of strategies for labelling molecules of DNA have been reported, such 
as microspheres (Anal. Chem. (2000) 72, 15: 3678-3681), gold nanoparticles 

15 (J. Am. Chem. Soc, (2000) 122, 15: 3795-3796), silver colloid particles (PNAS, 
(2000) 97, 3: 996-1 001 ) and quantum dots. Any labelling technique that allows 
unambiguous identification of the incorporated moiety can be utilised in this 
scheme. 

The first step in the synthesis procedure will be to form an array of single 
2 o molecules, onto which the monomers are to be incorporated. Immobilisation of 
the single molecules to the surface of a solid support may be carried out by any 
known technique. Generally the array is produced by dispensing small volumes 
of a sample onto a suitably prepared solid surface, or by applying a dilute 
solution to the solid surface to generate a random array. Immobilisation may 

2 5 occur by covalent or non-covalent interactions. 

The single molecules may themselves be monomers, prepared so that 
immobilisation with the solid support can occur. If the molecule is a monomer 
base, immobilisation will preferably occur at the 3 f -position to permit 
incorporation at the ^-position. Various linker molecules, e.g. polyethylene 

3 o glycol, may also be present. Further details of the preparation of these single 

molecule arrays is disclosed in WO-A-00/06770. 
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If the polymer is a polynucleotide, synthesis may be carried out by the 
use of conventional soi id-phase DNA synthesis techniques, e.g. using 
phosphoramidite chemistry, as disclosed in "Nucleic Acids in Chemistry and 
Biology" by Blackburn & Gait, Oxford University Press, pages 118-137, 
5 Tetrahedron Letters (1 990) 31 49: 7095-7098, and Tetrahedron Letters (2000) 
56: 271 3-2724. If a fluorescently-modified 5'-protecting group is used with the 
phosphoramidite, then the deprotection and removal of the fluorescent label 
can be carried out in a single step after each round of synthesis. Each round 
of synthesis may comprise one or more different monomers, e.g. the bases G, 

io C, A and T. The array may be synthesised randomly by incorporating all the 
different monomers during each round of synthesis, or in a more controlled 
fashion, using only one distinct monomer in each round of synthesis. 

The density of the arrays is not critical. However, the present invention 
can make use of a high-density of single polymer molecules, and these are 

1 5 preferable. For example, arrays with a density of 1 0 6 -1 0 9 polymers per cm 2 may 
be used. Preferably, the density is at least 10 7 /cm 2 and typically up to 10 8 /cm 2 . 
These high-density arrays are in contrast to other arrays which may be 
described in the art as "high-density" but which are not necessarily as high 
and/or which do not allow single molecule resolution. 

20 The extent of separation between the individual polymers on the array 

will be determined, in part, by the particular technique used to resolve the 
individual polymer molecule. Apparatus used to image molecular arrays are 
known to those skilled in the art. For example, a confocal scanning microscope 
may be used to scan the surface of the array with a laser to image directly a 

25 fluorophore incorporated on the individual polymer by fluorescence. 
Alternatively, a sensitive 2-D detector, such as a charge-coupled detector, can 
be used to provide a 2-D image representing the individual polymers on the 
array. 

Resolving single polymer molecules on the array with a 2-D detector can 
30 be done if, at 100 x magnification, adjacent polymers are separated by a 
distance of approximately at least 250nm, preferably at least 300nm and more 
preferably at least 350nm. It will be appreciated that these distances are 
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dependent on magnification, and that other values can be determined 
accordingly, by one of ordinary skill in the art. 

Other techniques such as scanning near-field optical microscopy 
(SNOM) are available which are capable of greater optical resolution, thereby 
5 permitting more dense arrays to be used. For example, using SNOM, adjacent 
polymers may be separated by a distance of less than 100nm, e.g. 10nm. For 
a description of scanning near-field optical microscopy, see Moyeref a/., Laser 
Focus World (1993) 29(10). 

An additional technique that may be used is surface-specific total 
10 internal reflection fluorescence microscopy (TIRFM); see, for example, Vale et 
a/. , Nature, (1 996) 380: 451 -453). Using this technique, it is possible to achieve 
wide-field imaging (up to 100 pm x 100 \im) with single polymer molecule 
sensitivity. This may allow arrays of greater than 1 0 7 resolvable polymers per 
cm 2 to be used. 

15 Additionally, the techniques of scanning tunnelling microscopy (Binnig 

et a/., Helvetica Physica Acta (1982) 55:726-735) and atomic force microscopy 
(Hansma et a/., Ann. Rev. Biophys. Biomol. Struct. (1994) 23:115-139) are 
suitable for imaging the arrays of the present invention. Other devices which 
do not rely on microscopy may also be used, provided that they are capable of 

20 imaging within discrete areas on a solid support. 

Suitable solid supports are available commercially, and will be apparent 
to the skilled person. The supports may be manufactured from materials such 
as glass, ceramics, silica and silicon. The supports usually comprise a fiat 
(planar) surface, or at least an array in which the polymers are in the same 

25 plane. Any suitable size may be used. For example, the supports might be of 
the order of 1-10 cm in each direction. 

It is important to prepare the solid support under conditions which 
minimise or avoid the presence of contaminants. The solid support must be 
cleaned thoroughly, preferably with a suitable detergent, e.g. Decon-90, to 

3 o remove dust and other contaminants. 

Because the array consists of optically resolvable polymers, the 
synthesis of each target polymer will generate a series of distinct signals as the 
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fluorescent events are detected. Details of the full sequence may then be 
determined. 

The sequence of the polymers is determined by the random 
incorporation of the monomers and not by the presence of any template 
5 molecule. Sequencing procedures are therefore not required, i.e. procedures 
requiring the use of the polymerase enzyme. 

The arrays of the invention are particularly suitable for analysis 
procedures where the spatially addressable polymers can be used to reveal 
information on an interacting molecule. For example, if the polymers are 
l o polynucleotides, the arrays may be used in hybridisation-based procedures, to 
reveal the sequence of target DNA which hybridises on the array. Uses of 
spatially addressed arrays are disclosed in WO-A-00/06770. 
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CLAIMS 

1. A method for forming a spatially addressable array of polymers 
immobilised on a solid support, comprising the steps of: 

(i) contacting an array of single molecules with one or more 
5 detectably labelled monomers, under conditions that permit 

incorporation of a monomer onto a molecule of the array, wherein 
the labelled monomer comprises a removable blocking group that 
prevents further monomer incorporation occurring; 

(ii) removing non-incorporated monomers and detecting the label on 
10 the incorporated monomer; 

(iii) removing the blocking group and any separate label; and 

(iv) optionally repeating steps (i) - (iii) to form a single polymer of 
defined sequence; 

wherein the array has a surface density which allows each polymer to be 
is individually resolved by optical microscopy. 

2. A method according to claim 1 , wherein the polymer is a polynucleotide, 
and the monomers are any of the bases A, C, T and G. 

3. A method according to claim 2, wherein each of the bases A, C, T and . 
G comprises a different label, and step (i) is carried out in the presence of all 

20 four bases. 

4. A method according to any preceding claim, wherein the label is a 
fluorophore. 

5. A method according to claim 4, wherein the label is detected using a 2-D 
fluorescent imaging device, a confocal fluorescence microscope or a CCD 

25 camera. 

6. A method according to claim 4 or claim 5, wherein the label is removed 
by photobleaching or by chemical or enzymatic cleavage. 

7. A method according to any preceding claim, wherein the array has a 
density of from 10 s to 10 9 polymers per cm 2 . 

30 8. A method according to claim 9, wherein the density is 10 7 to 10 8 
polymers per cm 2 . 
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9. A method according to any preceding claim, wherein the polymers are 
separated by a distance of at least 1 0Onm. 

1 0. A method according to claim 9, wherein the polymers are separated by 
a distance of at least 250nm. 
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